Machine Learning Methods for Personalized Email Prioritization Ph.D. Thesis

نویسندگان

  • Shinjae Yoo
  • Jaime Carbonell
  • Jamie Callan
چکیده

Email is one of the most prevalent communication tools today, and solving the email overload problem is pressingly urgent. A good way to alleviate email overload is to automatically prioritize received messages f1ording to the priorities of each user. However, research on statistical learning methods for fully personalized email prioritization has been sparse due to privacy issues, since people are reluctant to share personal messages and priority judgments with the research community. It is therefore important to develop and evaluate personalized email prioritization methods under the assumption that only limited training examples can be available, and that the system can only have the personal email data of each user during the training and testing of the model for that user. We focus on three aspects: 1) we investigate how to express the ordinal relations among the priority levels through classification and regression. 2) we analyze personal social networks to capture user groups and to obtain rich features that represent the social roles from the viewpoint of a particular user. 3) We also developed a semi-supervised (transductive) learning algorithm that propagates importance labels from training examples to test examples through messages and user nodes in a personal email network. These methods together enable us to obtain both a better modeling priority and an enriched vector representation of each new email message. Our contribution is as follows. First, we have successfully collected multiple users’ private email data with their fine grained personal priority labels. Second, we apply and propose learning approaches from multi-type information such as text, and sender / recipients information. Third, to supplement additional information to sparse training data, we identify the importance of a contact and similar contacts from social networks. Fourth, we exploit a semi-supervised learning on the personal email networks. Finally, we conducted and completed systematic evaluations with respect to email prioritization, targeting the discovery of better modeling of email priorities. Through our suggested approaches, email prioritization alleviates email glut and should help our daily productivity.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mining Social Networks for Personalized Email

Email is one of the most prevalent communication tools today, and solving the email overload problem is pressingly urgent. A good way to alleviate email overload is to automatically prioritize received messages according to the priorities of each user. However, research on statistical learning methods for fully personalized email prioritization (PEP) has been sparse due to privacy issues, since...

متن کامل

Human Language Technology Conference of the North American Chapter of the Association of Computational Linguistics Proceedings of the Main Conference

Email is the number one activity that people do on the internet: 74% of internet users check their email on an average day. Email use in offices has more than doubled since 2000, and is now over 8 hours a week. There are many great NLP problems for email, like automatic clustering and foldering, search, prioritization, automatically finding keywords within messages, finding addresses, and summa...

متن کامل

MASTER ’ S THESIS Michal Novák Machine Learning Approach to Anaphora

2010 I dedicate this thesis to my family, especially to my mother, who supports me and encourages me throughout my whole life. I would like to thank my supervisor, Ing. Zdeněk Žabokrtský, Ph.D., for his patience and for his valuable expert advices. Moreover, I really appreciate that my supervisor and one of my friends, Mgr. Pavol Rusnák, provided me with a technical support for development and ...

متن کامل

Game theoretical solution concepts for learning agents with extensive–form games

My Ph.D thesis focuses on the study of solution concepts for rational learning agents in extensive–form games in absence of common knowledge; specifically, on the definition of solution concepts, their search, analysis of static and dynamic property, characterization of learning dynamics. Summarily, my work is finalized to better understand how to integrate more thoroughly game theory and machi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009